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1 Purification Means 

2 

3 The present invention relates to purification means , 

4 in particular to means suitable for use in 

5 purification of soluble proteins. 
6 

7 Introduction 
8 

9 The recombinant production of protein in bacteria, 

10 yeast, insect and mammalian cell lines has become a 

11 cornerstone of biological research and the 

12 biotechnology industry. Classical biochemical and 

13 chroma tographical purification techniques usually 

14 produce inadequate amounts of a target protein to 

15 study its roles or actions. Even if enough of the 

16 protein can be purified, it usually involves 

17 cumbersome amounts of starting material or tissue 

18 and many processing steps are taken before 

19 reasonable purification can be achieved. 
20 

21 Recombinant expression of the target protein 

22 bypasses a lot of these problems. By introducing 



2 



1 the target protein's gene template to a cell line or 

2 bacterial culture, induced overexpression can result 

3 in significant levels of that protein being 

4 produced. Large amounts of protein make the 

5 purification a lot simpler, but the addition or 

6 fusion of purification domains or tags allows for a 

7 relatively simple one-step purification using 

8 affinity chromatography resins. However, 

9 occasionally, due to the varying nature of proteins, 

10 the production of soluble protein has remained • 

11 elusive with known tags unable to purify many 

12 proteins. In some cases, production of protein can 

13 be a problem due to differences in the machinery of 

14 bacterial cells. There is, therefore a need for a 

15 more versatile tag than is available currently on 

16 the market . The provision of such a versatile tag 

17 enabling , for example, improved ability to quickly 

18 produce and screen soluble protein in bacteria such 

19 as B.aoli would represent a major step forward in 

20 protein biochemistry. 
21 

22 Summary of the Invention 
23 

24 The present inventors have developed a novel 

25 purification tag based on the gene product of a 

26 sortase gene, in particular the srtA gene of 

27 Staphylococcus aureus. This tag, known as SMUT 

2a [Solubility enhancing Unique Tag] has been found to 

29 have exceptional activity, enabling the efficient 

30 purification of soluble domains of a number of 

31 proteins hitherto not able to be isolated 

32 efficiently using conventional purification tags. 



1 

2 Throughout this specification, reference to a SNDT 

3 Tag should be understood to mean a tag derived from 

4 a sortase gene product. 
5 

6 In a first aspect of the invention, there is 

7 provided a purification tag comprising a sortase, 

8 e.g srtA, gene product. 
9 

10 In preferred embodiments, the sortase gene product 

11 is a gene product of the srtA gene of Staphylococcus 

12 aureus • 

14 Also provided is the use of a sortase, e.g srtA, 

15 gene product as a p;urif ication tag. 
16 

17 Furthermore, according to a third aspect of the 

18 invention, there is provided an expression construct 

19 for the production of recombinant polypeptides, 

20 which construct comprises an expression cassette 

21 consisting of the following elements that are 

22 operably linked: a) a promoter? b) the coding region 

23 of a DMA encoding a sortase, eg srtA gene product as 

24 a purification tag sequence; c) a cloning site for 

25 receiving the coding region for the recombinant 

26 polypeptide to be produced; and d) transcription 

27 termination signals. 
28 

29 According to a fourth aspect of the invention, there 

30 is provided a method for producing a polypeptide, 

31 comprising: a) preparing an expression vector for 

32 the polypeptide to be produced by cloning the coding 
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1 sequence for the polypeptide into the cloning site 

2 of an expression construct according to the third 

3 aspect of the invention,- b) transforming a suitable 

4 host cell with the expression construct thus 

5 obtained; and c) cul curing the host cell under 

6 conditions allowing expression of a fusion 

7 polypeptide consisting of the amino acid sequence of 

8 the purification tag with the amino acid sequence of 

9 the polypeptide to be expressed covalently linked 

10 thereto; and, optionally, d) isolating the fusion 

11 polypeptide from the host cell or the culture medium 

12 by means of binding the fusion polypeptide present 

13 therein through the amino acid sequence of the 

14 purification tag. 

15 • .:. •. , . ' ..• 

16 The expression construct, herein referred to as 

17 pSNUT, may be made by modification of any suitable 

18 vector to include the coding region of a DNA 

19 encoding a sortase. In preferred embodiments, the 

20 expression construct is based on the pQB3 0 plasmid. 
21 

22 A sample of pSNDT was deposited with the National 

23 Collections of Industrial and Marine Bacteria Ltd. 

24 (NCIMB) , 23 St Machar Drive, Aberdeen, Scotland AB24 

25 3RY on 23 December 2002 under accession no NCIMB 

26 41153. 
27 

28 In a fifth aspect, there is provided a fusion 

29 polypeptide obtained by the method of the fourth 

30 aspect of the invention. 
31 
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1 The inventors have f ound that when a fusion 

2 polypeptide comprising a polypeptide/protein of 

3 interest and a SNUT tag is used as an antigen, the 

4 immune response generated is significantly stronger 

5 than that generated when the polypeptide/protein of 

6 interest alone is used as the antigen. 
7 

8 Thus, in a sixth aspect of the present invention, 

9 there is provided a method of inducing and/or 

10 enhancing an immune response to an antigen of 

11 interest, the method comprising administering the 

12 antigen of interest with a sortase, e.g srtA, gene 

13 product. The antigen of interest, which preferably 

14 is a polypeptide/protein of interest, may be 

15 administered simultaneously, separately or 

16 sequentially with. the sortase, e.g srtA, gene 

17 product. In preferred embodiments, the antigen of 

18 interest is linked to the sortase, e.g srtA, gene 

19 product, preferably in the form of a fusion 

20 polypeptide. 
21 

22 In a seventh aspect of the invention, there is 

23 provided the use of a sortase, e.g srtA, gene 

24 product as an immunogen. As with the sixth aspect, 

25 the sortase, e.g srtA, gene product is preferably 

26 administered as a fusion polypeptide comprising the 
2 7 sortase, e.g srtA, gene product and an antigen of 
28 interest. 

29 

30 In preferred embodiments, the sortase, e*g. srtA 

31 gene product (SNUT) is encoded by the nucleotide 

32 sequence shown in Figure 4 or a variant or fragment 
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20 
21 



23 
24 
25 



thereof. Preferably, the srtA gene product 
comprises amino acids 26 to 171 of the SrtA sequence 
shown in Figure 4 or a variant or fragment thereof. 



Variants and fragments of and for use in the 
invention preferably retain the functional 
capability of the polypeptide i.e. ability to be 

8 used as a purification tag. Such variants and 

9 fragments which retain the function of the natural 
polypeptides, can be prepared according to methods 

11 for altering polypeptide sequence known to one of 

12 ordinary skill in the art such as are found in 

13 references which compile such methods, e.g. 

14 Molecular Cloning: A Laboratory Manual, j. Sambrook, 

15 et al., eds., Second Edition, Cold Spring Harbor 

16 Laboratory Press, Cold Spring Harbor, New York, 

17 1989, or Current Protocols in Molecular Biology, P. 

18 M. Ausubel, et al., eds., John Wiley & Sons, inc., 

19 New York. 



A variant nucleic acid molecule shares homology 
22 with, or is identical to, all or part of the coding 
sequence discussed above. Generally, variants may 
encode, or be used to isolate or amplify nucleic 
acids which encode, polypeptides which are capable 
26 of ability to be used as a purification tag. 
27 

28 Variants of the present invention can be artificial 

29 nucleic acids (i.e. containing sequences which have 

30 not originated naturally) which can be prepared by 

31 the skilled person in the light of the present 

32 disclosure. Alternatively they way be novel, 
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1 naturally occurring, nucleic acids, which may be 

2 isolatable using the sequences of the present . 

3 invention. Thus a variant may be a distinctive part 

4 or fragment (however produced) corresponding to a 

5 portion of the sequence provided in Figure 4. The 

6 fragments may encode particular functional parts of 

7 the polypeptide. 
8 

9 The fragments may have utility in probing for, or 

10 ■ amplifying , the sequence provided or closely related 

11 ones. 
12 

13 Sequence variants which occur naturally may include 

14 alleles or other homologies (which, may include 

15 polymorphisms or mutations at one or more bases) . 

16 Artificial variants (derivatives) may be prepared by 

17 -those skilled in the art, for* instance by site 

18 directed or random mutagenesis , or by direct 

19 synthesis. Preferably the variant nucleic acid is 

20 generated either directly or indirectly (e. g. via 

21 one or amplification or replication steps) from an 

22 original nucleic acid having all or part of the 

23 sequences of Figure 4. Preferably it encodes a 

24 polypeptide which can be used a s a purification 

25 tag. 
26 

27 The term 'variant 1 nucleic acid as used herein 

28 encompasses all of these possibilities. When vised in 

29 the context of polypeptides or proteins it indicates 
3 0 the encoded expression product of the variant 

31 nucleic acid. 
32 
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8 

1 Homology (i. e. similarity or identity) may be as 

2 defined using sequence comparisons are made using 

3 FASTA and FASTP (see Pearson & Lipman, 1988. Methods 

4 in Enzymology 183 s 6398) . Parameters are preferably 

5 set, using the default matrix, as follows : 

6 Gapopen (penalty for the first residue in a gap) t - 

7 12 for proteins/ -16 for DMA 

8 Gapext (penalty for additional residues in a gap) s - 

9 2 for proteins/ -4 for DNA 
KTOP word length : 2 for proteins/6 for DNA. 

11 Homology may be at the nucleotide sequence and/or 

12 encoded amino acid sequence level. Preferably, the 

13 nucleic acid and/or amino acid sequence shares at 

14 least about 60%, or 70%, or 80% homology, most 

15 preferably at least about 90%, 95%, 96%, 97%, 98% or 

16 99% homology with the sequence shown in Figure 4. 
17 

18 Thus a variant polypeptide in accordance with the 

19 present invention may include within the sequence 
shown in Figure 4, a single amino acid or 2, 3, 4, 
5, 6, 7, 8, or 9 changes, about 10, 15, 20, 30, 40 
or 50 changes. In addition to one or more changes 

23 within the amino acid sequence shown, a variant 

24 polypeptide may include additional amino acids at 

25 the C terminus, and/or H- terminus . 
26 

27 Naturally, regarding nucleic acid variants, changes 

28 to the nucleic acid which make no difference to the 

29 encoded polypeptide (i.e.* degeneratively 

30 equivalent') are included within the scope of the 

31 present invention. 
32 
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1 Preferred, variants include one or more of the 

2 following changes (using the annotation of AF162687) : 

3 nucleotide 604 AAG causing an amino acid mutation of 

4 KAR; nucleotide 647 AAG, codon remains K, therefore 

5 a silent mutation; nucleotide 982 GAA causing an 

6 amino acid mutation of GAE. 
7 

8 Changes to a sequence, to produce a derivative, may 

9 be by one or more of addition, insertion, deletion 

10 or substitution of one or more nucleotides in the 

11 nucleic acid, leading to the addition, insertion, 

12 deletion or substitution of one or more amino acids 

13 in the encoded polypeptide. Changes may be by way of 

14 conservative variation, i- e. substitution of one 

15 hydrophobic residue such as isoleucine, valine, 

16 leucine or methionine for another, or the 

17 substitution of one polar residue for another, such 

18 as arginine for lysine, glutamic for aspartic acid, 

19 or glutamine for asparagine. As is well known to 

20 those skilled in the art, altering the primary 

21 structure of a polypeptide by a conservative 

22 substitution may not significantly alter the 

23 activity of that peptide because the side -chain of 

24 the amino acid which is inserted into the sequence 

25 may be able to form similar bonds and contacts as 

26 the side chain of the amino acid which has been 

27 substituted out. This is so even when the 

2 8 substitution is in a region which is critical in 

29 determining the peptides conformation. 

30 

31 Also included are variants having non-conservative 

32 substitutions. As is well known to those skilled in 
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the art, substitutions to regions of a peptide which 
are not critical in determining its conformation may 
not greatly affect its activity because they do not 
greatly alter the peptide's three dimensional 
5 structure . 
6 

In regions which are critical in determining the 
peptides conformation or activity such changes may 
confer advantageous properties on the polypeptide. 
10 indeed, changes such as those described above may 
confer slightly advantageous properties on the 
peptide e. g. altered stability or specificity. 



11 
12 
13 

14 SHUT tags and vectors may be used in methods of 

15 purifying a soluble domain of a peptide. 
Accordingly in a further aspect of the invention, 
there is provided a method of producing a soluble 

18 bioactive domain of a protein, the method 

comprising the steps of cloning DNA encoding at 
least one candidate soluble domain into at least one 
expression vector, transf ecting or transforming a 
host cell with said vector, expressing said DNA in 

23 said host cell, wherein said vector encodes a 

24 sortase gene product. 
25 

26 The sortase gene product is preferably in the form 

27 of a fusion protein. 
28 

29 The method may comprise the steps of analysis of dna 

30 coding for the protein of interest to identify 

31 antigenic soluble domains, designing oligonucleotide 

32 primers to amplify DNA encoding the domain, 



16 
17 



19 
20 
21 
22 



03- 7-Z3 ; i s : s 1 jMurgitroyd and co. ;oi4i30784ui 



11 

1 amplifying DNA, cloning the DNA, optionally 

2 screening clones for correct orientation of DMA, 

3 expressing DHA in expression strains, analysing 

4 expression products for solubility, analysing 

5 products and production of soluble bioactive protein 

6 domain. 
7 

8 The method optionally comprises the step of 

9 producing a soluble bioactive protein domain of said 
10 protein of interest. 

11 

12 The methods and tags of the Invention may be used 

13 with any suitable polypeptide/protein of interest, 

14 for example for the pur if i cat ion of such 

15 polypeptides /proteins of interest. As described 

16 herein and explemplif iecL in the following examples, 

17 . the inventors have demonstrated that the methods and 

18 tags of the invention enable the efficient 

19 purification of a a large number of proteins, many 

20 of which have not been amenable to efficient 

21 isolation using conventional methods and tags. 
22 

23 In preferred embodiments of the invention, the 

24 polypeptide/protein of interest is MAR1, Jakl or 

25 CD33, or a fragment thereof. 
26 

27 In particularly preferred embodiments, the 

28 polypeptide/protein of interest is a variable domain 

29 fragment e.g. a variable domain fragment of CD33 . 
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1 Preferred features of each aspect of the invention 

2 are as for each of the other aspects mutatis 

3 mutandis. 
4 

5 The invention is exemplified with reference to the 

6 following non limiting description and the 

7 accompanying figures in which 
8 

9 Figure 1 shows selected domains for amplification 

10 from in silico analysis. Representation of a 

11 candidate protein for the expression platform, in 

12 this case Jakl (human) . Pour fragments have been 

13 chosen by analysis as depicted. 
14 

15 Figure 2 shows denaturing dot-blot analysis of 

16 expression clones of fragments of MflRi in pQE30. 
17 

18 Figure 3 shows a ribbon Diagram of Staphylaocaua 

19 aureus sortase. Ribbon diagram of the putative 

20 structure of S. aureus SrtA protein (minus its N- 

21 terminal membrane anchor) - SHUT represents the 

22 portion of this structure between the two yellow 

23 arrows as shown. The yellow ball signifies a Ca** 

24 ion, essential for the biological activity of this 

25 protein. This diagram is taken from llangovan et 

26 ml., 2001 , PNAS 98 (11) 6056 

27 (doi: 10. 1073/pnas. 101064198) 
28 

29 F i gure 4 nhown fhe WiirVl eof-Mp a r r p ifm ee and a mino 

30 acid sequence of SNUT fragment. 
31 
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{a) This is the determined sequence of SNUT. The 
fragment was cloned into pQE30 using the Banff!! site 
of this vector- When in the wanted orientation, 
insertion results in the inactivation of the 
upstream cloning site, therefore allowing any- 
subsequent cloning of target inserts with the 
downstream BartKl site (see (b) for restriction map 
of sequence) . 

Figure 5- illustrates qualitative purification 
results using the SNOT fusion tag. (a) shows the 
elution profile on SDS-PAGB of SNUT-Jakl using AKT& 
Prime native histag purification- Successful 
elution of SNUT-Jakl construct is signified by the 
white arrow. (b) shows the elution profile on SDS- 
PAGB of SNUT-MARi using AKTA Prime native histag 
purification. Successful elution is shown by the 
arrow. (c) shows the same gel stained in (b) 
western blotted and detected using poly-histidine- 
HRP antibody. This is confirmation that the eluted 
species in (b) is actually SNUT-MAR1, of expected 
molecular weight. 

Figure 6 shows a Western blot of lysates using anti- 
histag antibody. 

Figure 7a illustrates the elution profile on SDS- 
PAGE of SNOT -CD 3 3 . 

Figure 7b illustrates a Western blot of the same gel 
from Figure 7a using anti-histag antibody to detect 
the proteins. 
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1 Figure 8a illustrates a Western blot using anfci- 

2 histag antibody to detect the proteins. 
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Figure 8b illustrates a Western blot of the same gel 
as Figure 8b using anti-SrtA antibody to detect the 



6 proteins . 
7 

8 Template analysis and primer design 
9 
10 



Analysis of the DNA coding for a protein of interest 
may be performed using software packages such as 

12 Vector NTI (Informax, USA) and 

13 BIASTP (http;//www.ncbi.nl m.nih.Qov/BI«AST/K p -f am t 

14 www.sano er.ac.uk/pfam) and TM pred 

15 ( wwvr. hgmp .mrc.ac.uk) which may be used to identify 

16 complete domains within the protein that 

17 significantly increase the likelihood of 
antigenicity and/or solubility when expressed as a 
subunit of the original protein coding sequence. 



In order to increase the possibility of identifying 
a soluble domain, preferably multiple sub-domains, 
more preferably at least three sub-domains, for 
example 3 to 9 sub-domains may be identified for 
processing. 



Oligonucleotide primers to amplify the selected sub- 
28 domains may be designed with the help of 

c °™nercially avialable software packages such as the 

30 internet software package Primer3 ( http;//www- 

31 genome . wi . mit . edu/aen o me software/other /p r imer 3 . html 

32 (Whitehead Institute for Biomedical Research) , 
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Vector NTI (www. iof ormaxinc , com} and DNASIS (Hitachi 
Software Engineering Company ( www.oligo.net) . 

Typically primers for use in a method of the 
invention are in the rang© 10-50 base pairs in 
length, preferably 15 to 30, for example 20 base 
pairs in length, with annealing temperatures in the 
range 45-72 *C, more conveniently 55-60*C. Primers 
may be synthesised using standard techniques or may 
be sourced from commercial suppliers such as 
invitrogen Life Technologies (Scotland) or MNG- 
Biotech AG (Germany) . 

PGR of Insert 

The desired inserts which encode the selected sub- 
domains are araplif ied using the primers designed 
specifically for that target gene using standard PCR 
techniques. The template DMA for amplification can 
be in the form of plasmid DNA, cDNA or genomic DNA, 
depending on whatever is appropriate or indeed 
available . Any suitable DNA polymerase may be used, 
for example, Platinum Taq, Pfu ( www . strat agene . com ) 
or Pfx (www.invitrogen.cora) „ . Any suitable PCR 
system may be used, for example, the Expand High 
Fidelity PCR system (Roche # Basel, Switzerland) „ 

Several different thermocycler conditions may be 
used with each set of primers. This increases the 
chance of the PCR working without having to 
individually optimise each new primer set. Typically 
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the following three programs may be used in the 
method : 



A standard PCR programme using the recommended 
annealing temperature provided with the 
primers . 

A standard PCR programme using 50°C as the 
temperature for annealing. 

A touchdown PCR programme, where the annealing 
temperature starts at a high temperature e.g 
6S°C for 10 cycles and then gradually decreases 
the annealing temperature to 50°C over the 
subsequent e.g 15 cycles. 



2. 



3. 



29 
30 
31 
32 



Buffer conditions may be adjusted as required, for 
example with respect to magnesium ion concentration 
or addition of DMSO for the amplification of 
difficult templates. Further details of a suitable 
purification method which may be used with the 
vector or tag of the invention can be found in our 
co-pending PCT application, filed on the same day as 
this application and claiming priority from GB 
0131026.7. 

The PCR products may be visualised using standard 
techniques, for example on a 1.5% agarose gel 
stained with Ethidium Bromide and the bands are cut 
out of the gel and purified using Mini elute gel 
extraction Kit (Qiagen, Crawley, England) . 

Expression Vectors 
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1 Amplified DMA inserts may be cloned into expression 

2 vectors using techniques dictated by the multiple 

3 cloning sites of the vector in question. Such 

4 techniques are readily available to the skilled 

5 person. 
6 

7 Any suitable expression system can be used in the 

8 invention. Preferably, the expression system is 

9 prokaryotic . Suitable vectors for use in the method 

10 of the invention include any vector which can encode 

11 SNUT. [Solubility enhancing Unique Tag] , for example 

12 pSNUT. This tag is based on the sequence of a trans- 

13 peptidase f ound on the surface of gram-positive 

14 bacteria. This protein is highly soluble, and 

15 expressed as very high levels . 
16 

17 The inventors have found that SNUT is an ideal 

18 fusion tag for conferring solubility and expression 

19 levels to target protein fragments. SNUT may be 

20 cloned into any suitable vector. For the purposes of 

21 the examples shown in this application, the sequence 

22 incorporating the SNUT fragment is cloned into pQE30 

23 (Qiagen, Valencia, CA) in a mariner allowing full use 

24 of the multiple cloning site (MCS) of this vector 

25 for downstream gene insertions. 
26 

27 Development of pSNUT 
28 

29 The inventors found that a tag based on the srfcA 

30 gene product from Staphylcoccua aureus is highly 

31 soluble, reacts well to purification schemes and 

32 expresses particularly well. It was hypothesised 
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1 that the incorporation of a portion or domain of 

2 this protein could represent a useful fusion tag in 
the present method, and indeed the expression of any 

4 poorly soluble protein in E. coli. Using NMR 

5 studies, the 3D structure of this protein has heen 

6 predicted and is shown in Figure 3. We hypothesised 
that by taking a portion of this structure, we could 

8 make a manipulateable protein tag, but not disturb 

9 its tertiary structure enough to reduce its highly 

10 favourable characteristics listed above. The region 

11 of this protein used as a solubility-enhancing tag 

12 is depicted by two arrows. 
13 

14 The SNOT tag was cloned into pQE30. However, it may 

15 be cloned into any suitable expression vector. 

16 Positive clones may be identified by denaturing dot 

17 blots, SDS-PAGE and Western blotting. Final 

18 confirmation of these clones was provided by DNA 

19 sequencing, and the sequence of the multiple cloning 

20 region of the resultant vector is shown in Figure 4. 
21 

22 Variances in the sequence of the SNOT domain were 

23 observed from the sequence for SrtA that has been 

24 logged in Genbank (AF162687) . The variances are 

25 {using the annotation of AF162687) nucleotide 604 

26 AAG causing an amino acid mutation of KAR; 

27 nucleotide 647 RAG, codon remains K, therefore a 
-28 ailent mutation; nucleotide 982 GAA causin g an anvi™ 

29 acid mutation of GAE. 
30 

31 Preliminary trials and native purification showed 

32 that the SNOT fragment was very soluble and its 
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1 characteristics were in no way diminished by 

2 truncation, thus showing that SNUT could represent a 

3 useful tag domain (data not shown) . As described in 

4 the Examples, to fully test the abilities of SNUT, 

5 we then chose two proteins were soluble protein 

6 production had proved impossible using conventional 

7 methods and using the other expression systems of 

8 the method of the present invention. Surprisingly, 

9 we found that, using pSNUT in the method of the 

10 invention, these proteins could be produced in 

11 soluble form. 
12 

13 Clone Propagation 
14 

15 Target insert/expression vector ligations may be 

16 propagated using standard transformation techniques 

17 including the use of chemically competent cells or 

18 electro- competent cells. The choice of the host 

19 cell and strain for transformation is dependent on 

20 the characteristics of the expression vectors being 

21 utilised. 
22 

23 Bacterial cells, for example, Escherichia, coll, are 

24 the preferred host cells. However, any suitable 

25 host cell may be used. In preferred embodiments, the 

26 host cells are EBcherchla coll. 
27 

28 The vectors may be used to each transfect or 

29 transform a plurality of different host cell 

30 strains. The set of host cell strains for 

31 individual vector may be the same or different from 

32 the set used with other vectors. 
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1 

2 In a particularly preferred embodiment of the 

3 invention, each vector may be transformed into three 

4 E. calx strains (for example, selected from 

5 Rosetta (DE3)pLad, Tuner (DE3)pLacl, Origami BL21 

6 (DE3)pLacX and TOP10F, Qiagezx) „ 
7 

8 Where the vectors are pQE based vectors, TOP10F' 

9 cells are preferred for the propagation and 

10 expression trials of such vectors. The present 

11 inventors have identified this strain as a more 

12 superior strain for these vectors than either of the 

13 recommended strains by the supplier (M15 and 

14 SG13009) , in terms of ease of use and- culture 

15 maintenance (only one antibiotic required as to two 

16 with M15 or SG13009 (www.quiagen.com) . Other F' 

17 strains such as XL1 Blue can be used, but are 

18 inferior to the, TOP10F' strain, due to lack of 

19. expression regulation (results not shown) . The use 

20 of TOP10F' (Invitrogen) for the propagation and/or 

21 expression pQE based vectors forms an independent 

22 aspect of the present invention. Other F' strains 

23 such as Xlil Blue may also be used, but are inferior 

24 to the TOP10F' • 
25 

26 After transformation, cells may be plated out onto 

27 selection plates and propagated for the development 

28 of single colonies using standard conditions. 

29 ' ; 

30 Propagation of Cells 
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1 

2 The colonies may be used to inoculate duplicate 

3 wells in a 96 well plate. 
4 

5 Typically, each well may contain 200 p.1 of LB broth 

6 with the appropriate antibiotics- Bach plate may be 

7 dedicated to one strain of JS. call or other host 

8 cell which alleviates the problems of different 

9 growth rates. The necessary controls are also 

10 included on each plate. The plates are then grown 

11 up, preferably at 37°C or any other temperature as 

12 appropriate to the particular host cell and vector, 

13 with shaking, until log phase is reached. This is 

14 the primary plate. 
15 

16 From the primary plate a secondary plate is seeded 

17 and then grown. Typically, the secondary plate is 

18 be seeded using * hedgehog' replicators and then 

19 grown up to, for example, log phase, chilled to 16°C 

20 for 1 hour. Determination of positive clones from 

21 these plates may be undertaken using functional 

22 studies. Routinely, 6-48 clones for each insert - 

23 vector ligation are taken and propagated in culture 

24 micro-titre plates containing up to 500 pi of media. 

25 According to the conditions and reagents required, 

26 protein production is then induced, and cultures 

27 propagated further. Most vectors are under the 

28 control of a promoter such as T7 , T7lac or T5, and 

29 can be easily induced with IPTG during log phase 

30 growth. Typically, cultures are propagated in a 

31 peptone -based media such as LB or 2YT supplemented 

32 with the relevant antibiotic selection marker. 
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These cultures are grown at temperatures ranging 
from 4-40 <>c, but more frequently In the range of 
20-37 *C depending on the nature of the expressed 
protein, with or without shaking and induced when 
appropriate with the inducing agent (usually log or 
early stationary phase) . After induction, growth 
propagation can be continued for 1-16 hours for a 
detectable amount of protein to be produced. 

The primary plate is preferably stored at 4°C until 
the process, is complete. 

colony screening for Inserts In Correct Orientation 

The method of the invention may include the step of 
testing transformants for correct orientation of the 
inserts. Identification of positive clones can be 
achieved through a variety of methods, including 
standard techniques such, as digestion analysis of 
plasmid DMA; colony PCR and DMA sequencing. 
Alternatively, dot -blotting may be used for the 
identification of positive clones for example, using 
a BioDot apparatus (BioRad) containing 
nitrocellulose membrane (0.45uM pore size) in 
accordance with the manufacturers' instructions, 
prior to final confirmation by DNA sequencing. 

The use of this dot blotting method in the platform 
represents a rapid, reproducible and robust ' 

deLecLiun method. TteLa ^ rticulax method xs useful 

for the rapid detection or presence of recombinant 
protein and allows for a determination of all clones 
irrespective of solubility and conformation. This 
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1 may be important at this stage, because 

2 conformational structures can inhibit the detection 

3 of tag domains if they are not presented properly on 

4 the surface of the protein. This can occur as 

5 easily with both soluble and insoluble protein. 
6 

7 As described above, standard colony PCR techniques 

8 may be used* For example, transf ormants may be 

9 selected, either manually or using automation such 

10 as the Cambridge BioRobitics BioPick instrument, and 

11 screened using directional PCR using a primer that 

12 encodes for a sequence on the vector such as S Tag 

13 or GATA sequence > and then the complementary primer 

14 from the insert, A PCR mix may be used such as the 

15 RedTaq DNA Polymerase (Sigma Aldrich, Dbrset* 

16 England) and the thermocycler conditions used may be 

17 the standard PCR programme using 50°C as the 

18 annealing temperature or adjusted as required. 
19 

20 Although all colony selecting and picking can be 

21 done manually, automated colony pickers are 

22 preferred. Automated colony pickers such a© the 

23 BioRobotics BioPick allow for the uniform and 

24 reproducible selection of clones from transformation 

25 plates. Clone selection determinants can be set to 

26 ensure picking colonies of a standardised size and 

27 shape. After picking and plate inoculation, 

28 propagation of clones can be carried out as 

29 described above. 
30 

31 Identification of positive clones can be achieved 

32 through a variety of methods, including standard 
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1 techniques such as digestion analysis of plasmid 

2 DNA; colony pgr and DMA sequencing Alternatively, in 

3 a preferred embodiment, the novel method of dot- 

4 blotting described herein for the identification of 

5 positive clones may be used in place of such 

6 traditional techniques, prior to final confirmation 

7 by DNA sequencing. The use of this method in the 

8 .platform presented here is not essential in the use 

9 of this platform over existing screening 

10 methodologies, but represents a rapid, reproducible 

11 and robust detection method. The protocol described 
here is a new protocol for an existing method for 
which commercially available equipment (Bio-Rad 

14 DotBlot) can be purchased. 
15 

15 This particular method is useful for the rapid 

17 detection or presence of recombinant protein and 

18 allows for a determination of all clones 

19 irrespective of solubility and conformation. This 

20 is useful at this stage, because conformational 

21 structures can inhibit the detection of tag domains 

22 if they are not presented properly on the surface of 

23 the protein. This can occur as easily with both 

24 soluble and insoluble protein. 



12 
13 
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For example, after growth on the micro-titre plates 
is complete, the plate is centrifuged at 4000 rpm 
28 for 10 minutes at 4«C to harvest the bacterial 
^ ceXIsT me supernatant is removed and the ceTT 



30 pellets are re-suspended in SO ul lysis buffer (10 

31 mM Tris.HCl, pH 9.0, lmM BDTA, 6 mM MgCl a > 

32 containing benzonaee {1 ul/ml) . The plate is 
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1 subsequently incubated at 4°C with shaking for 30 

2 minutes- A sample (10 nl) of the cell lysate is 

3 added to 100 \xl buffer (8 M urea, 500 mM WaCI, 2 0 mM 

4 sodium phosphate, pH 8.0) and incubated at room 

5 temperature for 20 minutes. Samples are then. 

6 applied to a BioDot apparatus (BioRad) containing 

7 nitrocellulose membrane (0.45)iM pore size) in 

8 accordance with the manufacturers' instructions. 

9 The membrane is removed and transferred into 

10 blocking reagent (3% w/v; Bovine serum albumin in 

11 TBS) for 30 minutes at room temperature. The blot 

12 is washed briefly with TBS then incubated in a 

13 primary antibody, specific to the tag being used for 

14 the subset of expression clones. Depending on the 

15 nature of the primary i.e., whether or not it has a 

16 horse radish peroxidase (HRP) reporter function, 

17 will depend on whether the use of a secondary is 

18 required. For detection of specific binding the 

19 membrane is then washed 2x 5 minutes in TBS followed 

20 by ix 5 minute wash in 10 mM Tris*HCl pH7.6, 

21 Detection of specifically bound antibody is 

22 disclosed by the addition of chromogenic substrate 

23 {6 mg diaminobenzidine in 10 ml 10 mM Tris.HCl pH 

24 7.6 containing 50 ill 6% H 2 O a ) • The reaction is 

25 stopped by thorough rinsing in water. Positive 

26 clones identified by this procedure can then be 

27 confirmed by DNA sequencing of the expression 

28 construct using now industry-standard techniques and 

29 equipment such as ABI and Amersham Biosciences. 
30 

3 1 Sequencing 

32 
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The sequencing reactions may be performed using 
techniques common in the art using any suitable 
apparatus. For example, sequencing may be performed 
on the cloned inserts, using the Big Dy e Terminator 
cycle sequencing kits (Applied Biosystems, 
Warrington, UK) and the specific sequencing primer 
run on a Peltier Thermal cycler model PTC225 <MJ 
Research Cambridge, Mass) . The reactions may be run 
on Applied Biosystems - Hitachi 3310 Sequencer 
according to the manufacturer's instructions. These 
sequences are checked to ensure that no PGR 
generated errors have occurred. 

AsseBsment of Solubility of Positive Clones 

The cells of positive clones may be harvested and 
soluble and insoluble protein detected. 

Any suitable techniques known in the art can be used 
to separate soluble and insoluble protein, such as 
the use of eentrifugation, magnetic bead 
technologies and vacuum manifold f titrations. 
Typically, however, the separated proteins are 
ultimately analysed by acrylamide gel and western 
blotting. This confirms the presence of recombinant 
protein at the correct size. 

in one embodiment, contents of each well in the 96 



29 
30 
31 
32 



well plate are transferred into a Millipore 0.65 um 
multi -screen plate. The plate is placed on a vacuum 
manifold and a vacuum is applied. This draws off 
the culture medium to waste. The cells are then 
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1 washed with PBS (optional) , again the vacuum is 

2 applied to remove the PBS. The multi-screen plate is 

3 removed from the manifold and bacterial cell lysis 

4 buffer (containing DNAse) (50 is added to each 

5 well. The plate is incubated at room temperature 

6 for 30 minutes with shaking to facilitate lysis of 

7 the cells, A fresh 96 well microtitre plate (ELISA 

8 grade) is placed inside- the vacuum manifold and the 

9 multi-screen plate is placed above it. When a 

10 vacuum is applied the contents of each well are 

11 drawn into the micro- titre plate below. The vacuum 

12 only needs to be applied for 20 seconds. The 

13 collected lysate contains the soluble fraction of 

14 expressed protein, A. sample of the collected lysate 

15 may subsequently analysed by SDS-PAGE and Western 

16 blotting to confirm both the presence and correct 

17 molecular weight of the target protein. 
18 

19 The use of SDS-PAGE and Western blotting can be 
2 0 expensive and time consuming/ especially when 

21 numerous samples must be analysed for each 

22 construct. In light of this we have developed a 

23 protocol whereby one gel can be used for both total 

24 protein staining and western blotting. This 

25 represents a significant improvement in this 

26 methodology and obviously allows cost saving, and 

27 precise comparisons can be made with regard to total 

28 protein and western blotting as both sets of results 

29 come from the one gel, 
30 

31 The basis of this protocol is in the ability to use 

32 chloroform and UV light to stain protein on an SDS- 
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1 PAGE gel (Kazmin efc al., Anal Biochem, 2001, 301(1) 

2 91-6; doi 5 10.1006/abio.2001.5488). We have used 
this technique to great effect as it allows for the 
extremely rapid staining of a SDS-PAGE gel in less 
than a tenth of the time taken using other more 
traditional staining methods such as Coramassie 

7 Brilliant Blue and Collodial Blue stains. We then 

8 decided to take this observation a step further and 
analyse the ability of a chloroform- stained gel to 
be used in Western blotting. This would not be 
expected to work as other stained gels result in the 
fixing of the protein to the gel and subsequent 
inability to transf er- the protein during blotting. 
This expectation is coupled to the fact that 
chloroform is not compatible with western blotting 
equipment <Bio-Rad SD blotter user's manual) . 
However, fortuitously, we have discovered that with 

18 a wash of the chloroform-stained gel in double- 
distilled water, to remove excess chloroform, and 
after subsequent soaking in transfer buffer, 
proteins were effectively transferred during western 

22 blotting in contrast to expectations. This transfer 

23 was no-less effective than from a gel that has not 

24 been pre-stained with chloroform and OV light. 

25 Figure 6 primarily shows results relating to the 
production of soluble protein by the platform, but 
also shows the ability to use the chloroform-stained 
SDS-PAGE derived western blot for the identification 
of proteins, without any apparent damage caused to 

30 the proteins. 
31 
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1 The use of a chloroform- stained SDS-PAGE derived 

2 western blot for the identification of proteins 

3 forms another aspect of the present invention. 
4 

5 Scale-Up and Purification 
6 

7 This analysis provides a picture of the expression 

8 status of the clones on each plate. Using this 

9 analysis, positive soluble protein expressing clones 

10 can be identified for the production of soluble 

11 recombinant protein for a given target protein. The 

12 clones may be selected and their growth scaled up 

13 e.g. to 5 ml scale, using the saved primary plate as 

14 an inoculum. Parameters that may be taken into 

15 consideration in deciding on the appropriate culture 

16 to select for scale -up include the desirability of 

17 specific regions for the production of an antigen, 

18 the overall expression levels of the clone and 

19 factors that may affect affinity purification such 

20 as amino acid composition. 
21 

22 Example 1. Expression construct design 
23 

24 Figure 1 is a diagrammatic representation of the 

25 protein Jakl. Using pfam, the position of distinct 

26 domains was established. Further analysis of these 

27 domains was then carried out using Tcopred and the 

28 Kyle and Dolittle hydrophobicity algorithm to 

29 determine the usefulness of these domains as soluble 

30 antigens. From this tentative analysis, four 

31 domains were selected for amplification and 

32 expression analysis. Based on this preliminary in 
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1 Bilico analysis, primers specific for a target 

2 protein were designed and used to amplify domains 

3 selected for analysis. 

4 

5 Vectors {500 ng) were restricted with BaraHI (20 

6 units) and Sail (20 units) in the presence of calf 

7 intestinal alkaline phosphatase (CIP) (2 units), gel 

8 purified and quantified using standard methods . 

9 Purified PCR fragments (100 ng) were restricted with 

10 BamHj (5 units) and Sail 5 units), gel purified, 

11 quantified, and then used in a ligation reaction 

12 with the restricted vector again using standard T4 

13 DNA ligase methods (Ready-to-Go T4 DNA ligase,. 

14 Amersham Biosciences) . A sample of the ligation 

15 reaction (1 ul) was then used to transform the 

16 appropriate competent bacterial cells (TOP10F' were 

17 used here for the pQB based vectors, a modification 

18 of the manufacturers recommendations; BL21(DE3)pLysE 

19 for pET43.1a and TOF10F' for pGEX-Pus) . 

20 Transformers were selected on LB/arapicillin (100 

21 ug/ml) overnight at 28*0. 



A Cambridge BioRobitics BioPick instrument was used 
for the picking of 24 colonies from each of the 

25 transformant plates into flat -bottomed and lidded 

26 micro-titre plates. The clones were used to 

27 inoculate 150 ul of LB (containing lOOug/ml 

28 ampicillin) , and these were allowed to grow 

29 overnight at 37 °C. 



31 A secondary plate was prepared hy the inoculation of 

32 200 ul of LB containing the required supplements 
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1 with 10 Til of the overnight primary culture. These 

2 were then grown at 37 °C Once an optical density 

3 (OD) of 0.25 at A550 was reached, IPTG (final 

4 concentration, 1 mM) was added to induce expression 

5 of the recombinant protein- Culture propagation was 

6 continued for another 4 hours prior to harvesting of 

7 bacterial cells. 
8 

9 After clones expressing specific recombinant protein 

10 have been identified, the solubility of these 

11 proteins has to be established prior to clone 

12 selection for purification. This can be performed a 

13 number of ways including the use of centrif ugation 

14 and automat ion- friendly vacuum manifold separations. 

15 The results here were obtained using methodologies 

16 based around the use of vacuum-assisted filtration 

17 to separate soluble and insoluble protein. The 
Id filtrates that were produced from the method 

19 described were then analysed by SDS-PAGE and Western 

20 blotting to confirm the production of a recombinant 

21 protein of the correct anticipated molecular weight. 
22 

23 Example 2 Design and Construction of SNOT Expression 

24 Tag 
25 

26 Based on analysis of the amino acid sequence and 

27 predicted structure of SrtAau, it was decided to 

28 amplify the region of amino acids 26 to 171 of the 

29 SrtA sequence. Amplification was conducted using 

30 the forward primer 5' TTTTTTAGATCTAAACCACATATCGAT 

31 and the reverse primer 5' 

32 TTTTTTGGATCCATCTAGAACTTCTAC . This product was then 
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1 digested with BglT and BamHI and ligated into pQE30 

2 vector which had also been digested with BairiKl to 

3 form the pSNDT vector. The ligation mix was 

4. transformed into TOP10F' cells and single colonies 

5 propagated on LB agar containing 100 ug/ml 

6 ampicillin. Clones with the artA fragment in the 

7 correct orientation were screened by expression 

8 analysis and positive clones identified using the 

9 denaturing dot-blot assay described earlier. 
10 

11 The sequence encoding the SMUT tag was cloned into 

12 pQB30 as described earlier and positive clones 

13 identified by denaturing dot blots, SDS-PAGE and 

14 Western blotting. Final confirmation of these 

15 ; clones was provided by DNA sequencing, and the 

16 sequence of the multiple cloning region of the 

17 resultant vector is shown in Figure 4. Variances in 
the sequence of the SNOT domain were observed from 
the sequence for SrtA that has been logged in 
Genbank (AF1626S7) . The variances are (using the 
annotation of AF162687) nucleotide 604 AAG causing 

22 an amino acid mutation of KAR; nucleotide 647 RAG, 

23 codon remains K, therefore a silent mutation; 

24 nucleotide 982 OAA causing an amino acid mutation of 

25 GAE. 
26 

27 Example 3 Trials of SHOT Expression Constructs 
28 



16 
19 
20 
21 



29 
30 
31 



Target inserts were cloned into the pSNUT vector 
using primer construction and digestion of resulting 
PCR amplifications with BamHI and Sail as described 
32 earlier. pSNUT was digested with BamHI in a similar 
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1 manner and the target inserts cloned as described* 

2 Clones were screened using the denaturing dot -blot 

3 system and then analysed with SDS-PAGE and western 

4 blotting. Positive clones were used fox preparative 

5 200 ml LB cultures containing 100 lig/ml ampicillin 

6 and induced as described earlier- This was grown to 

7 an optical density of 0.5 at Asso at 37 °C. 

8 Expression of SNDT was then induced with the 

9 addition of IPTG {final concentration, 1 triM) and 

10 left to grow ^f or another 4 hours. Cells were then 

11 harvested by centrifugation at 5K rpm for 15 

12 minutes. Cells were re- suspended in 30 ml PBS 

13 containing 0.1% Igepal and lysis induced by two 

14 freeze- thaw cycles. The suspension was then 

15 sonicated and centrifuged at 5K rpm for 15 minutes. 

16 The soluble supernatant was transf erred to a fresh 

17 container and filtered through a 0.8 |Xm disc filter 

18 to remove final cell debris. This solution was then 

19 applied to a Ni 2+ charged IMAC column (Amersham 

20 Biosciences HiTrap Chelating column/ 1 ml) using an 

21 AKTA Prime low pressure chromatography system and 

22 column was then treated using a standard native his- 

23 tag purification protocol involving washing of 

24 column with 20 mM sodium dihydrogen phosphate pH 8.0 

25 containing 10 mM imidazole , 500 mM NaCl, and elution 

26 of soluble his-tagged proteins using 20 mM sodium 

27 dihydrogen phosphate pH 8.0 containing 500 mM 

28 imidazole, 500 mM NaCl . Elution fractions were then 

29 analysed on an SDS-PAGE gel (4-20% SDS-PAGE Bio-Rad 

30 Criterion gel) , which was stained with chloroform as 

31 described earlier. This gel was then subsequently 

32 western blotted and the his-tagged protein detected 
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1 with anti-poly-histidine monoclonal antibody using 

2 the techniques described herein. 
3 

4 Preliminary trials and native purification showed 

5 that the SNUT fragment was very soluble and its 

6 characteristics were in no way diminished by 

7 truncation, thus showing that SHUT could represent a 

8 useful tag domain (data not shown) . To fully test 

9 the abilities of SNUT, we then chose two proteins 

10 for which soluble protein production had proved 

11 impossible using the other expression systems in 

12 which SHOT was not used as a tag. These were murine 

13 MAK.1 and human Jakl. Clones were prepared and 

14 selected using the method as described in the 

15 Examples above and positive clones were subsequently 

16 grown and induced at 37 °C. These were then treated 

17 to identical native histag purifications. Both 

18 proteins behaved very favourably under standard 

19 purification conditions as can be seen from the 

20 purification profiles in Figure 5. For both these 

21 trial proteins, this was the first example of such 

22 purification under soluble conditions. The 

23 production of these proteins using conventional 

24 techniques has failed to produce any soluble 

25 protein, irrespective of expression system or growth 

26 conditions used (data not shown) . However, as 

27 described in this example, when the protein 

28 fragments were expressed in pSKDT, soluble proteins 

29 can be surprisingly obtained. 
30 

31 The effectiveness of SNDT as a fusion protein is 

32 even more significant when it is considered that no 
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1 special growth conditions were required for the 

2 generation of soluble protein. This is remarkable 

3 when one considers the protein expressionist's 

4 standard GST tag which is not even soluble itself 

5 when expressed at 37 *C; 28 °C is required before 

6 even the generation of GST on its own without any 

7 ■ target protein is observed. 
8 

9 Example 4 Purification of CD33 fragments using SNOT 
10 Expression Constructs 
11 

12 Cleaning Results 
13 

14 CD33 contains two extracellular immunoglobulin 

15 -domains. The extracellular region of the CD33 DNA 

16 sequence had been cloned into several vectors for 

17 expression/ including expression as a fusion tag to 

18 DHFR and NusA. none of these vectors produced 

19 recombinant CD 3 3 protein. The CD33 extracellular 

20 region was also cloned into pSNOT. Both pSNOT and- 

21 CD 3 3 were restricted with BamHl and Hindlll under 

22 standard conditions and ligated together using T4 

23 DNA ligase, again under standard manufacturer' s 

24 protocols. TOP10F' cells were transformed with the 

25 ligation product. 
26 

27 S colonies were picked from the transformation plate 

28 and grown in 150ul LB in a 96-well plate at 37 "C 

29 overnight 
30 

31 Expression analysis t 
32 
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The overnight cultures were used to inoculate fresh 
LB cultures <10ul into 190ul LB + 50yg/ml 

3 ampicillin) and grown at 37°C for 2 hours. 

4 Expression of the SNUT-CD33 construct was induced 

5 with luiM IPTG. 
6 

Cells were pelleted after 4 hours and lysed in PBS + 
0.1% igepal. Lysates were analysed by western blot 
9 using anti-histag antibody. As shown in Figure 6, 
it was clear that colonies 1, 3 and 4 were positive 
11 and 2 was not (SNOT only) . 
12 

13 Large Scale Expression; 
14 

-The clone pertaining to lane 1 of Figure 6 was 
chosen for sequencing analysis, which proved 

17 successful insertion into the pSMUT vector. This 

18 clone was grown in large scale {200ral) for 

19 expression of the SNUT-CD33 construct at 37°C. 

20 Expression was induced whenever the OD600=0.4-0.6. 

21 After 4-6 hours expression, the cells were pelleted 

22 and lysed in 8M urea buffer. Lysates were clarified 

23 and purifed by immobilised metal affinity 
chromatography (IMAC) using a re-folding technique 
of decreasing urea concentration. At 0M urea, the 

26 SNUT-CD33 was eluted from the IMAC column and 

27 analysed by SDS PAGE using Coomassie blue stain 

28 (Figure 7A) and Western Blotting (Figure 7B) usin 3 

29 anti-histag antibody. 
30 

31 Antibody Detection of expressed protein: 

32 
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1 The SMUT fusion protein contains an N-terminal His- 

2 tag. This facilitates detection using commercially 

3 available ant i -His antibodies, and can be used as a 

4 means for purification of the recombinant protein 

5 via IMAC as described (see Figure 8a) . 
6 

7 In addition, we have developed in-house a polyclonal 

8 antibody against SNUT and it also provides a 

9 detection and purification means, as demonstrated in 
10 Figure-- 8b. 

11 

12 Results : 
13 

14 CD33 has been a very difficult protein to express. 

15 The most desirable part of the protein for antigen 

16 production is the extracellular variable domain. 

17 There are two immunoglobulin domains in the 

18 extracellular region of CD33, a membrane distal 

19 variable (IgV) domain and a membrane proximal 

20 constant (C2) domain. Expression analysis bad been 

21 performed for three fragments of the extracellular 

22 region: the variable domain, the constant domain and 

23 the full extracellular region in a number of 

24 commercially available expression vectors. Only the 

25 constant domain fragment would express in any of the 

26 vectors. In order to express the desired variable 

27 domain, the full length extracellular fragment and 

28 the IgV domain fragment were cloned into our pSNOT 

29 vector. Expression was successful for the full 

30 length fragment. 
31 
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1 The full length fragment was also purified 

2 successfully by re-folding on an 1MAC column. Not 

3 only has the pSNDT vector allowed us to express a 

4 protein fragment that has been unable to be 

5 expressed in any tried commercially available 

6 vector, including vectors with fusion tags designed 

7 to increase expression such as NusA and DHFR, but 
has allowed us to purify the expressed protein using 
immobilised metal affinity chromatography by 
standard techniques, and can be used for detection 
of any protein expressed in the vector using either 

12 anti-His or anti-SrtA antibodies. 
13 

14 All documents referred to in this specification are 

15 herein incorporated by reference. Various 

16 modifications and variations to the described 

17 embodiments of the inventions will be apparent to 
those skilled in the art without departing from the 
scope and spirit of the invention. Although the 

20 invention has been described in connection with 

21 specific preferred embodiments, it should be 

22 understood that the invention as claimed should not 

23 be unduly limited to such specific embodiments. 

24 Indeed, various modifications of the described modes 

25 of carrying out the invention which are obvious to 

26 those skilled in the art are intended to be covered 

27 by the present invention. 
28 
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1 Claims 



2 

3 l. Use of a sortase gene product as a purification 

4 tag. 
5 

6 2. Use of a sortase, e.g srtA, gene product as an 

7 immunogen . 
8 

9 3. The use according to claim 1 or claim 2 wherein 

10 the sortase gene product is a Staphylococcus 

11 aureus srtA gene product. 
12 

13 4 . The use according to any one of claims 1 to 3 

14 wherein the sortase gene product is encoded by 

15 the nucleotide sequence shown in Figure 4 or a 

16 variant or fragment thereof. 
17 

10 5. The use according to any. one of claims 1 to 4 

19 wherein the sortase gene product comprises 

20 amino acids 26 to 171 of the SrtA sequence 

21 shown in Figure 4 or a variant or fragment 

22 thereof. 
23 

24 6. An expression construct for the production of 

25 recombinant polypeptides/ which construct 

26 comprises an expression cassette consisting of 

27 the following elements that are qperably 

2 3 linked: a) a promoter; b) the coding region of 

29 a DNA encoding a sortase gene product as a 

30 purification tag sequence; and c) a cloning 

31 site for receiving the coding region for the 

32 recombinant polypeptide to be produced; and d) 
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transcription termination signals/ 

7. The expression construct according to claim 6 
wherein the sortase gene product is a 
Staphylococcus aureus srtA gene product • 

8. The expression construct according to claim 6 
or claim 7 wherein the sortase gene product is 
encoded by the nucleotide sequence shown in 
Figure 4 or a variant or fragment thereof. 

9. The expression construct according to any one 
of claims 6 to 8 wherein the sortase gene 
product comprises amino acids 26 to 171 of the 
SrtA sequence shown in Figure 4 or a variant o: 
fragment thereof. 

10. A method for producing a polypeptide, 
comprising: 

a) preparing an expression vector for the 
polypeptide to be produced by cloning the 
coding sequence for the polypeptide into the 
cloning site of an expression construct as 
claimed in any one of claims 6 to 9; 

b) transforming a suitable host cell with the 
expression construct thus obtained; and 

c) culturing the host cell under conditions 
allowing expression of a fusion polypeptide 
consisting of the amino acid sequence of the 
purification tag with the amino acid sequence 
of the polypeptide to be expressed covalently 
linked thereto,- and d) isolating the fusion 
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1 polypeptide from the host cell or the culture 

2 medium by means of binding the fusion 

3 polypeptide present therein through the amino 

4 acid sequence of the purification tag, 
5 

6 11. The method according to claim 10 , wherein the 

7 sortase gene product is a Staphylococcus aureus 

8 srtA gene product. 
9 

10 12. The method according to claim 10 or claim 11 

11 wherein the sortase gene product is encoded by 

12 the nucleotide sequence shown in Figure 4 or a 

13 variant or fragment thereof. 



The method according to any one of claims 10 to 
12 wherein the sortase gene product comprises 
amino acids 26 to 171 of the SrtA sequence 
shown in Figure 4 or a variant or fragment 
thereof , 



21 14. A fusion polypeptide obtained by the method of 

22 any one of claims 10 to 13 * 
23 

-24 15. A purification tag comprising a sortase gene 
25 product . 

26 
27 

28 16. ' The purification tag according to claim 15 

29 wherein the gene product is a Staphylococcus 

30 aureus srtA gene product. 



31 
32 
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16 
17 
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1 17. The purification tag according to claim 15 or 

2 claim 16 wherein the sortase gene product is 

3 encoded by the nucleotide sequence shown in 
Figure 4 or a variant or fragment thereof. 



4 
5 

6 18. The purification tag according to any one of 

7 claims 15 to 17 wherein the sortase gene 

8 product comprises amino acids 26 to 171 of the 

9 SrtA sequence shown in Figure 4 or a variant or 
10 fragment thereof. 

11 

12 19. A method of inducing and/or enhancing an immune 

13 response to an antigen of interest, the method 

14 comprising administering the antigen of 

15 interest with a sortase, e.g srtA, gene 

16 product . 
17 

18 20. The method according to claim 19, wherein the 

19 sortase gene product is a Staphylococcus aureus 

20 srtA gene product. 
21 

22 21. The method according to claim 19 or claim 20 

23 wherein the sortase gene product is encoded by 

24 the nucleotide sequence shown in Figure 4 or a 

25 variant or fragment thereof. 
26 

27 22. The method according to any one of claims 19 to 

28 21 wherein the sortase gene product comprises 

29 amino acids 26 to 171 of the SrtA sequence 

30 shown in Figure 4 or a variant or fragment 

31 thereof. 
32 
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Figure 6 
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Figure 8A Figure 8B 
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